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Abstract. In this paper we will present the two basic operations for database 
schemas used in database mapping systems (separation and Data Federation), 
and we will explain why the functorial semantics for database mappings needed 
a new base category instead of usual Set category. 

Successively, it is presented a definition of the graph G for a schema database 
mapping system, and the definition of its sketch category Sch(G). Based on this 
framework we presented functorial semantics for database mapping systems with 
the new base category DB. 



1 Introduction 

Most work in the data integration/exchange and P2P framework is based on a logical 
point of view (particularly for the integrity constraints, in order to define the right mod- 
els for certain answers) in a 'local' mode (source-to-target database), where a general 
'global' problem of a composition of complex partial mappings that involves a number 
of databases has not been given the correct attention. 

This work is an attempt to give a correct solution for a general problem of complex 
database-mappings and for high level algebra operators for database schemas (separa- 
tion, Data Federation), preserving the traditional common practice logical language for 
schema database mapping definitions. 

Only a few works considered this general problem 111 1213141 . One of them, which uses a 
category theory J2], is too restrictive: their institutions can be applied only for inclusion 
mappings between databases. 

There is a lot of work for sketch-based denotational semantics for databases I5I6I7I8I . 
But all of them use, as objects of a sketch category, the elements of an ER-scheme of 
a database (relations, attributes, etc..) and not the whole database as a single object, 
which is what we need in a framework of inter-databases mappings. It was shown in |9 | 
that if we want to progress to more expressive sketches w.r.t. the original Ehresmann's 
sketches for diagrams with limits and coproducts, by eliminating non-database objects 
as, for example, cartesian products of attributes or powers et objects, we need more ex- 
pressive arrows for sketch categories (diagram predicates in ||9] that are analog to the 
approach of Makkai in ifTOl ). Obviously, when we progress to a more abstract vision 
where objects are the (whole) databases, following the approach of Makkai, in the new 
basic category DB for databases, where objects are just the database instances (each 
object is a set of relations that compose this database instance), we obtained much 



more complex arrows. Such arrows are not simple functions, as in the case of base 
Set category, but complex trees (operads) of view-based mappings. In this way, while 
Ehresmann's approach prefers to deal with few a fixed diagram properties (commuta- 
tivity, (co)limitness), we enjoy the possibility of setting full relational-algebra signature 
of diagram properties. 

This work is an attempt to give a correct solution for this problem while preserving the 
traditional common practice logical language for the schema database mapping defini- 
tions. 

The instance level base database category DB has been introduced first time in Techni- 
cal report lUTl . and used also in JT2). General information about categories the reader 
can find in classic books ifTJl . while more information about this particular database 
category DB, with set of its objects Obus and set of its morphisms Motdb, are re- 
cently presented in 1141 . In this paper we will only emphasize some of basic properties 
of this DB category, in order to render more selfcontained this presentation. 
Every object (denoted by A, B, C,..) of this category is a database instance, composed 
by a set of n-ary relations a* £ A, i = 1, 2, ... called also "elements of A". 
We consider the views as a universal property for databases: they are the possible ob- 
servations of the information contained in an instance-database, and we may use them 
in order to establish an equivalence relation between databases. 

In ifTTIl has been defined the power-view operator T, with domain and codomain equal 
to the set of all database instances, such that for any object (database) A, the object TA 
denotes a database composed by the set of all views of A. The object TA, for a given 
database instance A, corresponds to the quotient-term algebra La/rj, where carrier is 
a set of equivalence classes of closed terms of a well defined formulae of a relational 
algebra, "constructed" by -constructors (relational operators in SPJRU algebra: se- 
lect, project, join and union) and symbols (attributes of relations) of a database instance 
A, and constants of attribute-domains. 

Different properties of the base DB category are considered in a number of previously 
published papers II 1 51 1 61 1 71 1 81 1 91 as well, where this basic power-view operator T is 
extended to the endofunctor T : DB — > DB. 

The connection between a logical (schema) level and this computational category is 
based on the interpretation functors. Thus, each rule-based conjunctive query at schema 
level over a database A will be translated (by an interpretation functor) in a morphism 
in DB, from an instance-database A (a model of the database schema A) to the instance- 
database TA composed by all views of A. 

1.1 Basic Database concepts 

The database mappings, for a given logical language (for default we assume the First- 
Order Language (FOL)), are defined usually at a schema level (iri, TT2 denote first and 
second projections, 1+J disjoint union, and h! the set of natural numbers), as follows: 

- A database schema is a pair A = (Sa, 2Ja) where: Sa = tti (.4) is a countable set 
of relation symbols r G R, ar : R — >• N, with finite arity (finite list of attributes 
x =< xx,...,x n >,n — ar(r) > 1), disjoint from a countable infinite set att 
of attributes (for any single attribute x G att a domain of a; is a nonempty subset 
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dom(x) of a countable set of individual symbols dom, disjoint from att ), such that 
for any r S R, the sort of R is a finite sequence of elements of att. Sa = ^2 (A) 
denotes a set of closed formulas (without free variables) called integrity constraints, 
of the sorted First-Order Language (FOL) with sorts att, constant symbols dom, 
relational symbols in Sa, and no function symbols. 

We denote by § the set of all database schemas for a given (also infinite) set R. 
We denote by A® the empty database schema (where 7Ti {A% ) and ir 2 {A% ) are empty 
sets). A finite database schema A is composed by a finite set Sa, so that the set of 
all attributes of such a database is finite. 

- We consider a rule-based conjunctive query over a database schema A as an ex- 
pression q(x) < — i?„(u„), where n > 0, Ri are the relation names (at 
least one) in A or the built-in predicates (ex. <, =, etc.), q is a relation name not 
in A, iii are free tuples (i.e., may use either variables or constants). Recall that if 
V = (vi, .., v m ) then i?(v) is a shorthand for R(v±, .., v m ). Finally, each variable 
occurring in x must also occur at least once in Ui, ...,u„. Rule-based conjunctive 
queries (called rules) are composed by: a subexpression i?i(ui), R n (u n ), that 
is the body, and q(x) that is the head of this rule. If one can find values for the 
variables of the rule, such that the body holds (i.e. is logically satisfied), then one 
may deduce the head-fact. This concept is captured by a notion of "valuation". In 
the rest of this paper a deduced head-fact will be called "a resulting view of a query 
q(x) defined over a database A", and denoted by ||q(x)||. Recall that the conjunc- 
tive queries are monotonic and satisfiable. The Yes /No conjunctive queries are the 
rules with an empty head. 

- We consider that a mapping between two database schemas A and B is expressed 
by an union of "conjunctive queries with the same head". Such mappings are called 
"view-based mappings" and can be defined by a set Ai = {qAi(*i) => 9si(xj)|l < 
i < n}, where =>■ is the logic implication between these conjunctive queries (^(x,) 
and qAi (xj), over databases A and B respectively. 

We consider a view of an instance-database A an n-ary relation (set of tuples) ob- 
tained by a "select-project-join + union" (SPJRU) query q(x) (it is a term of SPJRU 
algebra) over A: if this query is a finite term of this algebra than it is called a "fini- 
tary view" (a finitary view can have also an infinite number of tuples). 

- An instance of a database A is given by A = (A, I a), where I a is an Tarski's FOL 
interpretation function, that satisfies all integrity constraints in Sa, and maps each 
relational symbol of Sa (n-ary predicate in FOL) into an n-ary relation a, G A 
(called also "element of A" ). Thus, a relational instance-database A is a set of n- 
ary relations, and they are managed by relational database systems (RDBMS). 
Given two autonomous instance-databases A and B, we can make a. federation of 
them, i.e., their disjoint union A(+J B, in order to be able to compute the queries 
with relations of both autonomous instance-databases. 

A federated database system is a type of meta-database management system (DBMS) 
which transparently integrates multiple autonomous database systems into a single 
federated database. The constituent databases are interconnected via a computer 
network, and may be geographically decentralized. Since the constituent database 
systems remain autonomous, a federated database system is a contrastable alterna- 
tive to the (sometimes daunting) task of merging together several disparate databases. 
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A federated database, or virtual database, is the fully-integrated, logical composite 
of all constituent databases in a federated database system. 

McLeod and Heimbigner |20l were among the first to define a federated database 
system, as one which "define[s] the architecture and interconnects] databases that 
minimize central authority yet support partial sharing and coordination among database 
systems". Among other surveys, Sheth and Larsen lETI define a Federated Database 
as a collection of cooperating component systems which are autonomous and are 
possibly heterogeneous. 

1.2 DB (Database) category 

Based on an observational point of view for relational databases, we may introduce a 
category DB lfT4l for instance-databases and view-based mappings between them, with 
the set of its objects ObnB, and the set of its morphisms Motdb, such that: 

1. Every object (denoted by A, B, C,..) of this category is a instance-database, com- 
posed by a set of n-ary relations en G A, i = 1, 2, ... called also "elements of A". 
We define a universal database instance T as the union of all database instances, 
i.e., T = {ai\ai E A,A £ Obos}- It is the top object of this category. 

We have that T = TT, because every view v G TT is an instance-database as well, 
thus v G T. Vice versa, every element r G T is a view of T as well, thus r G TT. 
Every object (instance-database) A has also the empty relation _L. The object com- 
posed by only this empty relation is denoted by _L° and we have that T_L° = _L° = 

Two objects A and B are isomorphic in DB, denoted by A ~ B, if T A = TB. 
For any instance-database A it holds that A C T A and A ~ T A. 
Any empty database (a database with only empty relations) is isomorphic to this 
bottom object _L°. 

2. Morphisms of this category are all possible mappings between instance-databases 
based on views, as they will be defined by formalism of operads in what follows. 

In what follows, the objects in DB (i.e., instance-databases) will be called simply databases 
as well, when it is clear from the context. Each atomic mapping (morphism) in DB be- 
tween two databases is generally composed of three components: the first correspond to 
conjunctive query ^ over a source database that defines this view-based mapping, the 
second (optional) Wi "translate" the obtained tuples from domain of the source database 
(for example in Italian) into terms of domain of the target database (for example in En- 
glish), and the last component vi defines which contribution of this mappings is given 
to the target relation, i.e., a kind of Global-or-Local-As- View (GLAV) mapping (sound, 
complete or exact). 

In what follows we will consider more simple case without the component um. 
We introduce also the two functions do,di such that 9o(9a 4 ) = {rn, ■■■,rik} (the set 
of relations used in the query formula qA i (x) and d\ (qA f ) — {fi}, with obtained view 
n = WqaMW- 

Thus, we can formally introduce a theory for view-mapings based on operads: 
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Definition 1. We define the following two types of basic mappings: 

— LOGIC-SENTENCE MAPPING: For any sentence, a logic formula ipi without free 
variables over a Database schema A, we can define a schema mapping ifi : 
A — > A0. The unique instance-database of the empty shema A0 is denoted by 
_L = {-L}, where _L denotes the empty relation. Consequently, for each interpreta- 
tion a it holds that a*(A^) =_L°. 

This kind of schema mappings will be used for the integrity constraints over database 
schemas, and Yes/No queries, as will be specified in Section\3\ 

- VIEW-MAPPING: For any query (a logic formula withfree variables) over a schema 
A we can define a schema map qi : A — > {?';}> where qi G 0(rn, ?*ifc,r.s), 
Q = (ni,...,rik) C A 

For a given a the correspondent view-map at instance level is qA t = <Z_l} • 

A — > TA, with _Lg A = a*(A) C TA), d (qj_) = dx(q ± ) = {_L}. For 
simplicity, in the rest of this paper we will drop the component q^ of a view-map, 
and assume implicitly such a component; thus, do(qAi) = o*{Q) C A and 
d\{qAi) = { a ( r i)} ^ TA is a singleton with the unique element equal to view 
obtained by a "select-project-join+union" term qi. 

Thus, we introduce an atomic morphism (mapping) between two databases as a set of 
simple view-mappings: 

Definition 2. Atomic morphism: Every schema mapping f Sc ,h ■ A — > B, based 
on a set of query-mappings qi, is defined for finite natural number N by 
fsch = {vi-qi\qi G 0(m,...,rik, r'i), Vi G 0{r' t ,n), 
{r a ,...,n k } Q A, n e B, 1 < i < N}. 

Its correspondent complete morphism at instance database level is 
f = a*(f S ch) = { = a(vi) ■ a(qi) | v { ■ qi G f Sc h} : A—y B, where: 
Each ct{qi) is a query computation, with obtained view a(^) G TAfor an instance- 
database A = a*(A) = I rk G A}, and B = a*(B). 

Let ir qi be a projection function on relations, for all attributes in d\(a(qi)) = {a(r")}. 
Then, each a(vi) : a(r^) — > a(rj) is one tuple-mapping function, used to distinguish 
sound and exact assumptions on the views, as follows: 

1. inclusion case, when a(r[) C ir q . (af(rj)). Then for any tuple t G a(r^), a{vi)(t) = 
t\, for some t\ G a(Vj) such that n qi ({ti}) = t. 

2. exact case, special inclusion case when a{r[) = n qi (a(ri)). 

We define \\qAi\\ — <^(f'i) the extension of data transmitted from an instance-database 
A into B by the component qA t - 

Notice that the components a(vi),a(qi) are not the morphisms in DB category: only 
their functional composition is an atomic morphism. Each atomic morphism is a com- 
plete morphism, that is, a set of view-mappings. Thus, each view-map : A — > TA, 
which is an atomic morphism, is a complete morphism (the case when B = TA, and 
a{vi) belongs to the "exact case"), and by c-arrow we denote the set of all complete 
morphisms. 
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Based on atomic morphisms (sets of view-mappings) which are complete arrows (c- 
arrows), we obtain that their composition generates tree-structures, which can be in- 
complete (p-arrows), in the way that for a composed arrow h = g o / : A — > C, of 
two atomic arrows / : A — > B and g : B — > C, we can have the situations where 
do(f) C do(h), where the set of relations in do(h) — 9o(/) C do(g) are denominated 
"hidden elements". 

Definition 3. The following BNF defines the set Mor]j B of all morphisms in DB: 
p — arrow := c — arrow \ c — arrow o c — arrow (for any two c-arrows f : A — > B 
and g : B — > C ) 

morphism := p — arrow \ c — arrow op — arrow {for any p-arrow f : A — > B 
and c-arrow g : B — > C ) 

whereby the composition of two arrows, f (partial) and g (complete), we obtain the 
following p-arrow (partial arrow) h = g o f : A — > C 

h = gof= (J {q B] } o 

q B] agU ao(g Bj .)f|'9i(/)#0 

° IJ {QA t (tree)} 

lAi& f & 9l(<jA i ) = {l'} & v£ d (q Bj ) 

= {q Bj o{q Az (tree) \ d^) C d (q Bj )} I q Bj & g & d (q Bj ) HMf) + 0} 
= {q B] {tree) \ q B] G g & d (q Bj ) R h (f) + 0} 
where q Ai (tree) is the tree of the morphisms f below q Ai - 

We define the semantics of mappings by function B? : Motd B — 5- Ob]j B , which, 
given any mapping morphism f : A — > B returns with the set of views ("information 
flux") which are really "transmitted" from the source to the target object. 

1. for atomic morphism, f = B T (f) = T{||/j|| | ,f £ /}. 

2. Let g : A — > B be a morphism with a flux g, and f : B — > C an atomic morphism 
with flux f defined in point 1, then f o g = Bx{.f ° g) — f C\ 9- 

We introduce an equivalence relation over morphisms by, f w g iff f = 'g. 

Notice that between any two databases A and B there is at least an "empty" arrow 
: A -> B such that <9 O (0) = <9i(0) = = {_L} = ±°. 

Basic properties of this database category DB as its symmetry (bijective correspondence 
between arrows and objects, duality (DB is equal to its dual DB OP ) so that each limit 
is also colimit (ex. product is also coproduct, pullback is also pushout, empty database 
_L° is zero objet, that is, both initial and terminal object, etc.), and that it is a 2-category 
has been demonstrated in 11 11141 . 

Generally, database mappings are not simply programs from values (relations) into 
computations (views) but an equivalence of computations: because of that each map- 
ping, from any two databases A and B, is symmetric and gives a duality property to the 
category DB. The denotational semantics of database mappings is given by morphisms 
of the Kleisli category DBt which may be "internalized" in DB category as "computa- 
tions" mi. 

The product A x B of a databases A and B is equal to their coproduct A + B, and the 
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semantics for them is that we are not able to define a view by using relations of both 
databases, that is, these two databases have independent DBMS for query evaluation. 
For example, the creation of exact copy of a database A in another DB server corre- 
sponds to the database A + A. 

The duality property for products and coproducts are given by the following commuta- 
tive diagram: 



A A 




B B 



In the paper II15I16I15I have been considered some relationships of DB and standard 
Set category, and has been introduced the categorial (functors) semantics for two basic 
database operations: matching ®, and merging ©, such that for any two databases A 
and B, we have that A <g> B = TA f| TB and A © B = T(A (J B). In the same work 
has been defined the algebraic database lattice and has been shown that DB is concrete, 
small and locally finitely presentable (lfp) category. Moreover, it was shown that DB 
is also V-category enriched over itself, was developed a metric space and a subobject 
classifier for this category, and demonstrated that it is a weak monoidal topos. 
In this paper we will develop a functorial semantics for the database schema mapping 
system, based on the theory of sketches. 

The plan of this paper is the following: In Section 2 we will present the two ba- 
sic operations for database schemas used in database mapping systems (separation and 
federation), and we will explain why the functorial semantics for database mappings 
needed a new base category instead of common Set category. 

Finally, in Section 3 is presented a definition of the graph G for a schema database 
mapping system, and the definition of its sketch category Sch(G). Based on this frame- 
work then is presented functorial semantics for database mapping systems with the base 
category DB. 

2 Basic database schema operations: Separation and Federation 

For the composition of complex database mapping graphs, it is important to distinguish 
two basic compositions of two database schemas A and B with respect to DBMSs: 

- the case when in this composed schema, the two database schemas are mutually 
separated by two independent DBMSs, in order that it is impossible to write a query 
over this composition with relations of both databases: it is common case when 
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two databases are separated, and this symmetric binary separation-composition at 
schema level will be denoted by A f B, such that iTi(A \B) = TTi(A) 1+J TTi(B), i = 
1,2. 

- the case when in this composed schema, the two database schemas are connected 
into the same DBMS (without any change of the two original database schemas): In 
this case we are able to use the queries over this composed schema with relations of 
both databases for inter database mappings, and this symmetric binary federation- 
composition at schema level will be denoted by A (J) B, such that 7T; (AQ)B) = 

m(A)\i)n(B),i = 1,2. 

The identity = for database schemas is naturally defined by: for any two A,B £ §, 
A = B if iTi(A) = TTi(B), i = 1,2. Notice that both symmetric binary operators, f, (J), 
for database schemas in § are associative with identity element ^0) (nullary operator), 
so that the algebraic structures ((§, =), f, A®) and ((§, =), (J), ^0) are the monoids. 
Let us consider, for example, a mapping Ai : A\B — > C, and a mapping Ai : A (J) B — > 
C. In the first case in any query mapping q(x) => gc( x ) £ M the all relation symbols 
in the query q(x) must be of database A or (mutually exclusive) of database B, and this 
mapping can be equivalently represent by the graph: 



A B 




C 



where .MaI+J -Mb = M., while in the case of mapping M : AQ)B —> C such an 
decomposition is not possible, because we can have a query mapping q(x) => qc( x ) £ 
A4 with relation symbols from both databases A and B. 

If we introduce the mappings M.\ = {rAi{xi) rAi(xi)\rAi <E .4} and A^2 = 
{^Si(yi) => fBi{yi)\rBt & B}, then we obtain the mapping graph, 



A 




B 



C 



that can be seen as a cocone diagram for schema database mappings. 
Let us consider another dual example, a mapping A4 : C — s- A f B. In this case in 
any query mapping <7c(x) => q(x) <E Ai the all relation symbols in the query q(x) 
must be of database A or (mutually exclusive) of database B, and this mapping can be 
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equivalently represent by the graph: 



A B 




C 



where Ma l+J Mb = M. 

If we again introduce the mappings Mi = {?'/u(xi) => rAi{x-i)\rAi & A} and M2 = 
{rsi(yi) => r Bl (y t )\rBi G B}, then we obtain dual mapping graph, 



A 




B 



C 



that can be seen as a cone diagram for schema database mappings. 

Based on these two simple examples, generally, the schema database mappings can 
be expressed by using the small sketches. The detailed presentation of sketches for the 
database mappings, and their functorial semantics will be given in Section[3] 
Sketches are developed by Ehresmann's school, especially by R.Guitartand and C.Lair 
1 22 23 24 1 . Sketch is a category together with distinguished class of cones and cocones. 
A model of the sketch is a set-valued functor turning all distinguished cones into limit 
cones, and all distinguished cocones into colimit cocones, in the category Set of sets. 
There is an elementary and basic connection between sketches and logic. Given any 
sketch, one can consider the underlying graph of the sketch as a (many-sorted) lan- 
guage, and one can write down axioms in the Loo oo -logic (the infinitary FOL with 
finite quantifiers) over this language, so that the models of the axioms become exactly 
the models of the sketch. 

the category of models of a given sketch has as objects the models, and arrows all natural 
transformations between the models as functors. A category is sketchable (esquissable) 
or accessible iff it is equivalent to the category of set-valued models of a small sketch. 
Recall that a graph G consists of a set of vertices denoted Go and a set of arrows de- 
noted G\ together with the operators dom, cod : G\ —> Gq which assigns to each arrow 
its source and target. (Co)cones and diagrams are defined for graphs in exactly the same 
way as they are for categories, but commutative co(cones) and diagrams of course make 
no sense for graphs. 

By a sketch me mean 4-tuple (G, u, D, G) where G is a graph, u : Go — >• Gi is a func- 
tion which takes each vertex (node) A in Go to an arrow from A to A, D is a class of 
diagrams in G and G is class of (co)cones in G. Each (c)cone in G goes (to)from some 
vertex (from)to some diagram; that diagram need not be in D, in fact it is necessary to 
allow diagrams which are not in D as bases of (co)cones. 
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Notice that, differently from the work dedicated to categorical semantics of Entity- 
Relationship internal relational database models, where nodes of sketches are single re- 
lations, here at higher level of abstraction, the nodes are whole databases: consequently 
in a such framework we do not use commutative database mapping systems, thus D is 
empty set. In fact, in the database mapping system, the (co)cone diagrams above will 
never be used in practical representation of database mapping systems: instead will be 
alternatively used only its selfconsistent parts, as first diagram above, or, equivalently, 
a single arrow M : A f B — > C. 

But, clearly, for the introduced dataschema composition operator f, with cone and co- 
cone diagrams above has to be present in C for our sketches. 

Consequently we obtain the following fundamental lemma for the categorial modelling 
of database mappings: 

Lemma 1. The Set can not be used as the base category for the models of database- 
mapping sketches. 

Proof: Let E be a sketch for a given database sketch (G, u, D, C), where C is a set of 
(co)cones of the two diagrams introduced for the database schema composition operator 
f, and a model of this sketch be a functor F : E — > B, where B is a base category. Then 
all cones in C has to be functorially translated into limit commutative diagrams in B, 
and all cocones in C has to be functorially translated into limit commutative diagrams 
in B: i.e., the cocone in the figure above has to be translated into coproduct diagram, 
and cone into product diagram in B. 

Consequently, the object F(A f B) has to be both the product A x B and coproduct 
A + B, where A — F(A) and B = F(B) are two objects in B and x , + the product 
and coproduct operators in B, but it can not be done in Set. In fact, the product A x B 
in Set is the cartesian product of these two sets A and B, while the coproduct A + B is 
the disjoint union, so that does not hold the isomorphism Ax B ~ A + B in Set. 
□ 

Remark: The fundamental consequence of this lemma is that we needed to define a new 
base category for the categorial semantics of database mappings. In fact, we defined 
this new base category B, denoted by DB (DataBase) category, and we have shown 
that it satisfies the duality property where the product and coproduct diagrams are dual 
diagrams, so that for any two objects (instance databases) in DB the objects Ax B and 
A + B are equal (up to isomorphism). 

2.1 Data Separation: (Co)product operator in DB 

Separation-composition of objects are coproducts (and products) in DB category: 

Definition 4. The disjoint union of any two instance-databases (objects) A and B, de- 
noted by A + B, corresponds to two mutually isolated databases, where two database 
management systems are completely disjoint, so that it is impossible to compute the 
queries with the relations from both databases. 
The disjoint property for mappings is represented by facts that 
do(f + 9) ± do(f) + dob), di(f + g) 4 &(/) + d^g). 
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Thus, for any database A, the replication of this database (over different DB servers) 
can be denoted by the coproduct object A + A in this category DB. 

Proposition 1 For any two databases (objects) A and B we have that T(A + B) = 
T A + TB. Consequently A + A is not isomorphic to A. 

Proof: We have that T(A + B) = TA + TB, directly from the fact that we are able to 
define views only over relations in A or, alternatively, over relations in B. Analogously 
f + g = J + g, which is a closed object, that is, holds that T(f + g) = T(f + g) = 
Tf + Tg = f + g = fTg. 

From T(A + A) = TA + TA ± TA we obtain that A + A is not isomorphic to A. 

□ 

Notice that for coproducts holds that C+ JL° = _L° +C ~ C, and for any arrow 
/inDB, /+ -L 1 « -L 1 +/ « /, where _L X is a banal empty morphism between 
objects, such that d (± 1 ) = <9i(-L x ) =-L°, with T 1 =_L°. 

We are ready now to introduce the duality property between coproducts and products in 
this DB category: 

Proposition 2 There exists an idempotent coproduct bifunctor + : DB x DB — > DB 
which is a disjoint union operator for objects and arrows in DB. 

The category DB is cocartesian with initial (zero) object _L° and for every pair of 
objects A,B it has a categorial coproduct A + B with monomorphisms (injections) 
in a : A ^ A + B and in B ■ B ^ A + B. 

By duality property we have that DB is also cartesian category with a zero object _L°. 
For each pair of objects A, B there exists a categorial product AxB with epimorphisms 
(projections) pa = "in^ : A x A -» A and ps = "ing P : B x B -» B, where the 
product bifunctor is equal to the coproduct bifunctor, i.e., x = +. 

Proof: 1 . For any identity arrow (id a , ids ) in DB x DB, where id a, idb are the identity 

arrows of A and B respectively, holds that id a + ids = id a + ids = TA + TB = 

T(A + B) = idA+B- Thus, + 1 (id J 4, ids) = idA + ids = idA+B, is an identity arrow 
of the object A + B. 

2. For any given k : A — ► A u k\ : A x — > A 2 , I : B — > B lt h : B 1 — ► B 2 , holds 

+ i (*i,ii)^+^(* > o = + i (*i.ii)n+ i (*.o = *i °kTii ° i = o k, h o i) 

= + 1 ((h,k)o(l 1 ,l)), thus + 1 (ki,h) o+\k,l) = + 1 ((ki,k) o 

3. Let us demonstrate the coproduct property of this bifunctor: for any two arrows / : 
A — > C, g : B — > C, there exists a unique arrow k : A + B — > C, such that 
f = k o in a, g = k o ins, where in a '■ A A + B, iub '■ B >• A + B are the 
injection (point to point) monomorphisms (iriA = TA, ins = TB). 

It is easy to verify that for any two arrows / : A — > C, g : B — > C, there is exactly 
one arrow k = ec°(f + g) '■ A + B — > C, where ec ■ C + C -» C is an epimorphism 
(with eZ< = TC), such that k = f + g. 
□ 
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2.2 Data Federation operator in DB 

The opposite operation to (co)product (a DBMS's separation) is the DBMS's Data fed- 
eration of two database instances A and B. 

A federated database system is a type of meta-database management system (DBMS) 
which transparently integrates multiple autonomous database systems into a single 
federated database. The constituent databases are interconnected via a computer net- 
work, and may be geographically decentralized. Since the constituent database systems 
remain autonomous, a federated database system is a contrastable alternative to the 
(sometimes daunting) task of merging together several disparate databases. A federated 
database, or virtual database, is the fully-integrated, logical composite of all constituent 
databases in a federated database system. 

In this way we are able to compute the queries with the relations of both databases. In 
fact, Data Federation technology is just used for such an integration of two previously 
separated databases. 

Consequently, given any two databases (objects in DB) A and B, the federation of 
them (under the common DBMS) corresponds to disjoint union of them under the same 
DBMS, thus, equal to database A[*)B. 

3 Categorial semantics of database schema mappings 

It is natural for the database schema A = (Sa, £a), where Sa is a set of n-ary rela- 
tion symbols and Sa are the database integrity constraints, to take Sa to be a tuple- 
generating dependency (tgd) and equality-generating dependency (egd). We denote by 
A0 the empty database schema with empty set of relation symbols, where £a is the 
empty set of integrity constraints. 

A tgd says that if some tuples, satisfying certain equalities exist in the relation, then 
some other tuples (possibly with some unknown values), must also exist in the relation. 
An egd says that if some tuples, satisfying certain equalities exist in the relation, then 
some values in these tuples must be equal. Functional dependencies are egd's of a spe- 
cial form, as for example primary-key integrity constraints. 

These two classes of dependencies together comprise the embedded implication de- 
pendencies (EID) |25l which seem to include essentially all of the naturally-occuring 
constraints on relational databases (the bolded variables x, y denotes a nonempty list of 
variables): 

1 . a tuple-generating dependency ( tgd) of the FOL form 

Vx(3y0 A (x,y) => 3z (V>a(x, z)) 
where the formulae <f>A (x, y) and i/'a(x, z) are conjunctions of atomic formulas over 
A (for integrity constraints over database schemas we will consider only class of 
weakly-full tgd for which query answering is decidable, i.e., when the right-hand 
side has no existentially quantified variables, and if each yi E y appears at most 
once in the left side). 

2. an equality-generating dependency (egd): Vx ((/^(x) =>■ (xi = X2)) 
where a formula <\>a (x) is a conjunction of atomic formulas over A, and Xi , X2 are 
among the variables in x. 
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Notice that any schema database mapping from a schema A into a schema B is rep- 
resented by the general tgd Vx (3y <^4(x,y) => 3z (^s(x,z)), that is by the view 
mapping <?a(x) => <7b(x), as will be used in what follows in Definition[7] where <7a(x) 
(equivalent to (3y a (x, y)), is a query over the schema A, and gs(x) (equivalent to 
(3z ^b(x, y)), is a query over the schema B. 

In what follows we will explain how the logical model theory for database schemas and 
their mappings based on views, can be translated into the category theory by using the 
DB category defined in the previous chapter. The integrity constraints for databases are 
expressed by the FOL logical sentences (the FOL formulae without free variables), and 
such sentences are expressed in the schema database level by the mappings from the 
database schema A into the empty database schema A®. We define their denotation in 
the DB category as follows: 

Definition 5. For any sentence <p : A — > A0 ( a logic formula without variables, in 
Definition [7} over a database schema A and a given interpretation a such that tp is 
satisfied by it, then there exists the unique morphism from A into terminal object _L° in 
DB category, f : A — >_L°, where f = a*(ip) and A = a* (A) (for A ~_L° as well). 
Otherwise, when A = a* (A) is not isomorphic to _L°, ip is not satisfied by a then 
a* {tp) is mapped into the identity arrow id±o :A_°— >J_°. 

Notice that, differently from view-mappings for queries (formulae with free variables) 
given in Definition^ the integrity constraints have in a DB category the empty database 
(zero object, i.e., terminal an initial) as the codomain, and the information flux equal 
to _L = {J-}. It is consistent with definition of morphisms in DB category, because the 
sentences do not transfer any data from source to target database, so, their information 
flux has to be empty. In the case of ordinary query mappings, the minimal information 
flux is {±} =_L° as well. In DB category, for any unique morphism from initial object 
_L° (empty database) to another object (database) A, f :± ^ A, the information flux 
of these morphisms is also equal to _L°. 

Based on this semantics for logic formulae without free variables (integrity constraints 
and Yes/No queries), we are able to define the categorial interpretations for database 
schema mappings, as follows. 

3.1 Categorial semantics of database schemas 

As we explained in Section [2] in order to define the database mapping systems we will 
use two fundamental operators for the database schemas, data federation (J) and data 
separation f, with the two correspondent monoids, ((§, =), f, ^0) and ((§, =), 0, -40), 
and with the distribution low: A 0(£ \ C) = {A® B) f (-40 C). 
Consequently, each vertex in a graph G of a database mapping system, is a term of the 
combined algebra of these two monoids, §Aig = ((§, =), 0, t> As)- 
In what follows, we say the database schema for any well formed term (i.e., an algebraic 
expression) of this algebra for schemas E>Aig, and we denote by A £ §Aig a database 
schema that can be an atomic schema, or composed schema by a finite number of atomic 
schemas and two algebraic operators and data separation f of the algebra E>Ai g - 
Consequently for each schema A G §Aig, we have that A = (Sa, ZJa), where Sa = 
1+JISb; I Bi is an atomic schema in the schema expression .4.} and Sa — 1+1 {^Bi \ Bi 
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is an atomic schema in the schema expression A}. 

For each atomic schema database and an interpretation a, we have that A = a* (A) 
is an instance-database of this schema, thus, it is an object in DB category. For the 
composite schemas (the non atomic terms of the algebra &Alg their interpretation in DB 
category is given by the following Proposition: 

Proposition 3 For a given interpretation a the following homomorphismfrom schema 
database level and instance database level there exists: 
a* : (($, =), 0, f, A%) -> {{Ob DB , ~), y, +, J_°). 

Proof: The interpretation of a given schema A is an instance A = a* (A) of this 
database, that is an object in DB, while for every interpretation a*(A%) =_L°. 
From the monoidal property we have the equation AQ) A® = A in the algebra S>Ai g - 
By the homomorphism above we have that a*(=) = ~,a*(®) = 1+J, so that 
a*(A($A®) = a*(A)\Sa*(A®) = A[+J _L°~ A. From the monoidal property we 
have the equation A f A$ = A in the algebra §Aig- By the homomorphism above we 
have that a*(f) = +, so that a*(A\ A$) = a* (A) + a* {Am) = A+ 1°, and for 
coproducts in DB it holds the isomorphism A+ _L°~ A. 
□ 

Let A = (Sa, £a) be the database schema, where Sa is a set of relation symbols with 
a given list of attributes and Sa = 1J S e Jf d = TT2 (A) are the database integrity 
constraints (set of EIDs) which can be empty st as well. 

We can represent it by a sketch schema mapping <f>A ■ A — > A® (4>A denotes the sen- 
tence obtained by conjunction of all formulae in Sa), where A0 is the empty schema, 
such that for any interpretation a it holds that a(A0 ) =_L°. 

Proposition 4 If for a database schema A = (Sa, Sa) there exists a model A which 
satisfies all integrity constraints Sa = S^ d (J S e ^ d ( tpA will denote the sentence ob- 
tained by conjunction of all formulae in Sa), then there exists the following interpre- 
tation R-algebra a and its extension, the functor a* : Sch(G) — > DB, where Sch(G) 
is the sketch category derived from the graph G with the arrow Sa ■ A — > A0 (i.e., 
Sch(G) is composed by the objects A, A0, the arrow (f>A '■ A — > A0 and the identity 
arrows id a ■ A — > A and idA '■ A0 — > A0 ), such that: 

1. a* (A) = A, where A is possibly empty database instance ( with all empty relations) 
as well 

2. a*(idjC) = id a ■ A — > A 

3. a*(id A0 ) =id ± o :_L°— 

4. a*(4> A ) = (ft g d\Jfegd)-A^±°. 

Proof: from Definition|5]and the point 4 of this proposition we have that each integrity 
constraint g, G Sa of the database schema A is satisfied by the interpretation a (be- 
cause the conjunction of all integrity constraints, denoted by <$>a, is satisfied w.r.t the 
Definition |5): if Sa is empty, then it is always satisfied as usual. Thus a is a model 
of a database schema A, and the instance of this model is the nonempty database 
A = a* (A), that is an object in the DB category. 
□ 
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Notice that any empty database A (such that all its relations are empty) is isomorphic 
to the database _L° with only one empty relation _L (i.e., _L°= {_!_}). It is easy to show, 
based on the fact that any arrow for this empty database / : A — > A has the information 
flux / =_L°, so that / = id a is the unique identity arrow for this empty database. But 
the unique arrows g : A — >±.° and h > A have the same information fluxes, i.e, 
g = h =_L°, so that g o h = id±o and ho g = id a, and, consequently, A ~_L°. 
Consequently, the remark in the point 1 of this Proposition specifies that if A = a* (A) ~_L° 
is empty database, than a* is a model of a schema A, and integrity constraint for point 
4 corresponds to the satisfaction of this integrity constraint w.r.t. the Definition|5] 

3.2 Categorial semantics of database mappings 

First of all, we will define formally a schema database mapping graph G, as follows: 

Definition 6. A schema database mapping graph G is composed by an atomic arrow 
Sa '■ A — > Aq, for each database schema A, and a number of (a view based) atomic 
schema database mappings A4 : A — » B between two given schemas A and B. 
we will use the following basic binary operators for these database mapping graphs: 

- Given two mappings M.\ : A — > B and M.2 '■ B — >• C, we will denote their 
sequential composition in this graph G by M.2\M.\, where ; is a binary associative 
but non commutative operator. 

— Given two mappings M.\ : A — > B and M.% : A —> C, we will denote their 
branching in this graph G by A^l+J-^i : A — > B\C, where (+J is a binary 
associative and commutative operator of disjoint union. 

This is easy to verify that a graph G can be extended into a sketch category Sch(G). 
The semantics of a view-based mapping M = {qAi(^i) => <tej(Xj)|l < i < n) from 
a relational database schema A into a database schema B, is a constraint on the pairs 
of interpretations, of A and B, and therefore specifies which pairs of interpretations 
can co-exist, given the mapping (see also |1|). The formalization of the embedding 
7 : G —> Sch(G) of a graph G into the sketch Sch(G) can be given by iteration of the 
following rules: 

Definition 7. We consider the view-based mappings between schemas defined in the 
SQL language of SPJRU algebra. The arrows in the sketch Sch{G), for any arrow 
A4 : A — >• B in a given graph G in Definition® where M. = {qAi(*i) <lBi(xi)\l < 
i < n} and qAi{Xi), qBifai) are open FOL formulae over A, are defined as follows: 

1. for each qAi{xi) => QBi(Xi) £ M. such that that qsi is not a relation symbol of 
a database schema B, we introduce a new relation r^JCj) in "/(B) (we will use the 
same symbol for this "/-enlarged database schema by these new relations). Then we 
introduce in Sch(G) the single mapping arrow j 'm : A — > B, where, 

fM = l{M) = |J {v vqi :A — >B\ 9 (?i) = Ri, dtiqi) = d (O, = {n}} 

l<i<n 
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where qi,Vi are abstract "operations" (operads) introduced in Definition^ such 
that for a given model a of this database schema mapping, a(qi) is a query com- 
putation of a query qAi{xi). The set Ri is the set of relation symbols in A used in 
the formula qAi{xi). 

2. for each qAi(Xi) qBi(Xi) £ M such that qsi is not a relation symbol of a 
database schema B (then qAi{xi) =^ qBifai) (logical implication between queries) 
means that each tuple of the view obtained by the query qAifai) is also a tuple of 
the view obtained by the query qBi(xi)), we do as follows: 

We introduce in this sketch Sch(G) a new helper database schema Ci with a single 
relation Ci{xi, y), and two new schema mappings: 
!ac, = Wi ■ qi : A-> Ci (with d (wi) = <9i(<?i), di(wi) = {cj ), and 
fBC, =w' i -q' i :B (with d (qi) = &M) = 3>M)> W) = {d}. 

where R[ is the set of all relation symbols in B used in the formula qBi{Xi)), 
such that j Ad corresponds to {qAi(xi) => Ci(xi, \\a)} and fsd corresponds to 
{qBi(xi) => Ci(x%, tjs)}, where \a-, \b two new values not present in the do- 
minium of the databases. Consequently, we introduce in G also the integrity con- 
straint arrow tpi : Ci — > A0 for this new schema Ci, where the sentence ifi is equal 
to the tgd Vxi (3y(ci(xi,y) Ay = tu) =>■ 3z(cj(xj, z) A z = \\ B ))- 

It is easy to verify that in the obtained sketch Sch(G), between given any two nodes 
there is at maximum one arrow. There is a fundamental functorial interpretation con- 
nection from schema mappings and their models in the instance level category DB: 
based on the Lawvere categorial theories [26.27 j, where he introduced a way of de- 
scribing algebraic structures using categories for theories, functors (into base category 
Set, which we will substitute by more adequate category DB), and natural transforma- 
tions for morphisms between models. 

For example, Lawvere's seminal observation that the theory of groups is a category with 
group object, that group in Set is a product preserving functor, and that a morphism of 
groups is a natural transformation of functors, is an original new idea that was succes- 
sively extended in order to define the categorial semantics for different algebraic and 
logic theories. 

This work is based on the theory of sketches, which are fundamentally small categories 
obtained from graphs enriched by concepts such as (co)cones mapped by functors in 
(co)limits of the base category Set. It was demonstrated that, for every sentence in 
basic logic, there is a sketch with the same category of models, and vice versa (28]. 
Accordingly, sketches are called graph-based logic and provide very clear and intuitive 
specification of computational data and activities. For any small sketch E the category 
of models Mod(E) is an accessible category by Lair's theorem and reflexive subcat- 
egory of Set E by Ehresmann-Kennison theorem. A generalization to base categories 
other than Set was proved by Freyd and Kelly (1972). In what follows we will substi- 
tute the base category Set by this new database category DB. 

For instance, for the separation-composition mapping cocone diagram (graph G), given 
in the introduction, its translation in a sketch (a category Sch(G)) is presented in the left 
commutative diagram below (notice that mapping arrow M. in a graph G ar replaced 
by the morphism Jm in this sketch, while the nodes (objects) are changed eventually 
by introducing another auxiliary relation symbols as explained in Definition[7]i, and the 
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functorial translation of this sketch into DB category has to be coproduct diagram in 
DB as follows: 

J{A ) J^i 7 (^) t7 ( B )^ 7 (S) A ^4 A + B B 




7 (C) C 

As we explained in the introduction, in database mapping systems, expressed by a graph 
G, we will never use "commutative diagrams" as left diagram above (but only an arrow 
Im '■ l(A) f ~f(B) — > 7(C), or, more frequently, two simple arrows Jm a = li-M-A) '■ 
7(71) 7(C) and f Ma = l{M B ) ■ l{B) -> 7(C)), our sketch E = Sch(G) will 
be a simple small category, i.e., 4-tuple (G, u, D, C) where D and C are empty sets. 
Consequently, these database-mapping sketches are more simple than the sketches used 
for definition of Entity-Relationship models of single relational databases. 

Propositions Let Sch(G) be a schema sketch category generated from a schema 
mapping graph G, obtained by applying method in Definition \7[for each mapping be- 
tween two database schemas in a given database mapping system with n > 2 database 
schemas. Let an interpretation R-algebra a satisfies the following property: for any 
database schema A, (object in Sch(G)), a satisfies the Proposition^ so that A = 
a* (A) 6 Ob ob is a model of the database schema A, and for each schema mapping 
arrow fsch ■ A — > B, (where B is not empty schema) the atomic morphism in DB 
category a* (fsch) ■ ce* (A) —> a* (£>) is determined by banal set-inclusion case of Def- 
inition^ 

Then there is the functor (categorial model) a* : Sch(G) — > DB . The set of 
categorial models of the database schema mapping graph G is equal to the homset 
hom(Sch(G),DB) of all functors from these two categories in the category Cat, i.e. 
equal to the set of all objects in the category of functors DB Sch ( G ^ as well. 
For a given model (functor) a* G DB Sch ( G \ its image inDB will be calleda DB-mapping 
system, and denoted by M.s- 

Proof: This is easy to verify, based on general theory for sketches ll27ll : each arrow in a 
sketch (obtained from a schema mapping graph G) may be converted into a tree syntax 
structure of some morphism in DB (labeled tree without any interpretation). The functor 
a* is only the simple extension of the interpretation R-algebra function a for a lists 
of symbols. The functorial property for the identity mappings follows from Proposition 
[4] and for two atomic mappings fs c h ■ A — > B, gsch ■ B — > C, and their atomic 
morphisms in DB, / = a* (f Sc h), 9 = a* (gsch), we have that a* (g Sc h ° fsch) =9°f 
as defined in Definition [3] It remains only to verify that for each auxiliary database 
schema d and integrity constraint ipi : C% — > A% (in Definition [7]) the operator a* 
satisfies the functorial property, such that this schema arrow is mapped into the arrow 
a*((fii) : Ci — where C; = a*(C,). In fact it holds, because if a is a model of this 
database mapping system represented by the graph G, then this integrity constraint <pi 
is satisfied, and based on the Definition [5] the functorial property is satisfied. 
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Notice that this is true also in special cases when d = a*(Cj) ~ _L°. This case 
happens when for a view mapping ?a(x) => qs (x), both [ | (x) 1 1 (resulting view of 
the query <?a(x) over the database A = a* (.4)), and ||qs(x)|| (resulting view of the 
query gs(x) over the database B = a*(B)), are empty relations, and consequently 
a(ci(x, y)) is empty relation in the database instance G = a*(d) = {a(cj(x, y))}, so 
that Q ot±°. 

Thus an integrity constraint tp i : d — > A$ (an auxiliary arrow in Sch(G) obtained 
from some mapping between two database schemas in G) can be unsatisfied only if 
Ci = a*(Ci) is not isomorphic to _L°. 

In order to prove that the set of functors in DB Sch( - G - ) is exactly the set of al models of 
the database mapping system expressed by the graph G, it is now enough to prove that 
any interpretation a that is not a model of G, then can not be a functor from Sch(G) into 
DB. In order that a given a is not a model of G, must be satisfied one of the following 

cases: 

1. case when for some database schema A in G, a* (A) is not a model of this database: 
it means that the conjunction of all integrity constraints Sa : — > ^4.0 of is not 
satisfied by a (thus when a* (A) not isomorphic to _L°, as specified by Definition[5]i, so 
that from Definition [5] it holds that a*{E a ) = id±o :_L°— >-_L°, and it does not satisfy 
the functorial requirement because database instance a* (A) is not isomorphic to _L°. 

2. case when some integrity constraint cpi : d — > A® (an auxiliary arrow in Sch(G) 
obtained from some mapping between two database schemas in G), with G, = a*(Ci) 
is not isomorphic to _L°, is not satisfied by a, so that from Definition it holds that 
a*((fii) = id±o s>_L°, and it does not satisfy the functorial requirement (because 
database instance a*{Ci) is not isomorphic to _L°). 

□ 

Notice that in this functorial semantics for database mappings from an original schema 
database mapping M. : A — > B, with a correspondent arrow Jm = ~f(A4) : "/(A) 
7(B), where 7(-4), 7(£>) are the original database schemas enlarged by a number of 
auxiliary relations introduced in Definition [7] can be translated into the arrow / = 
a*(j(Ai)) : a* (A) — > a*{B) between the instances of the original schema databases 
A and B without added auxiliary relations, because we have that a* (A) , a* (A) are 
isomorphic in DB to a*(j(A)), a*(-f(A)), respectively, as follows: 

Proposition 6 For any database schema A and B, in a given schema database mapping 
graph G, it holds that A = a* (A) ~ a*(-f(Aj). 

Consequently, any functorial semantics of a given schema database mapping M : A — > 
B is represented in the DB category by the morphism, f w a*(~f(M)) : a* (A) — > 
a*{B). 

Proof: It is easy to show that T(a*(A)) = T(a*(j(A))), because each 7-added rela- 
tion ri(x.i) is just a subrelation of the view obtained by the query qs i (x^) over the rela- 
tions in the original database B, that is part of the view mapping <j J 4 i (x, ) => (x;) G 
M : A^t B (see point 2 in Definition [7]). 

Consequently, we have the isomorphisms isA ■ a*{A) ~ a*(7(„4)), iss ■ a*(B) ~ 
a*(7(B)), so that, / = isg 1 o a*(j(M)) o is A : A ->• B, i.e., / w a*(j(M)). 
□ 
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That is the reason that instead of 7 (.4) we can use the original database schemas A and 
their database instances A = a* (A) in the DB category. 

4 Conclusions 

In previous work we defined a base database category DB where objects are instance- 
databases and morphisms between them are extensional GLAV mappings between databases. 
We defined equivalent (categorically isomorphic) objects (database instances) from the 
behavioral point of view based on observations: each arrow (morphism) is composed 
by a number of "queries" (view-maps), and each query may be seen as an observation 
over some database instance (object of DB). Thus, we characterized each object in DB 
(a database instance) by its behavior according to a given set of observations. In this 
way two databases A and B are equivalent (bisimilar) if they have the same set of its 
observable internal states, i.e. when TA is equal to TB. It has been shown that such a 
DB category is equal to its dual, it is symmetric in the way that the semantics of each 
morphism is an closed object (database) and viceversa each database can be represented 
by its identity morphism, so that DB is a 2-category. 

In JT3 15.) has been introduced the categorial (functors) semantics for two basic database 
operations: matching and merging (and data federation), and has been defined the alge- 
braic database lattice. 

Here we considered the schema level for databases and their view-based mappings, 
based on queries. The fundamental operations for databases in the view of inter-mappings 
between them is the fact if they are separated od federated databases. It depends on the 
kind of DBMS system used for two mapped databases: when two databases are feder- 
ated then we can compute the queries over the relations of both databases; when they 
are separated by two independent DBMS, then DBMS can compute only the queries 
with all relations of only one of these two databases. 

We have shown that these two fundamental operators, data separation and data feder- 
ation, used in schema database mapping system, need a different base category from 
Set where coproducts are equal to products (up to isomorphism). Then we defined the 
Graphs schema database mapping systems, and the sketches for such database graphs. 
Consequently we defined the categorical functorial semantics for these sketches into 
new base database category DB. 
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